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[0001] A SINGLE USER DETECTION USER EQUIPMENT 

[0002] This application is a continuation of U.S. Patent Application No. 09/814,346, 
filed March 22, 2001, which claims priority to U.S. Provisional Patent Application No. 
60/266,932, filed February 6, 2001 and U.S. Provisional Patent AppHcation No. 60/268,587, 
filed February 15,2001. 

[0003] BACKGROUND 

[0004] The invention generally relates to wireless communication systems. In 
^ particular, the invention relates to data detection in a wireless communication system. 
^ [0005] Figure 1 is an illustration of a wkeless communication system 10. The 

communication system 10 has base stations 12, to I25 which conraiunicate with user 
P equipments (UEs) 14, to I43. Each base station 12, has an associated operational area, 

s where it conmiunicates with UEs 14, to 1 43 in its operational area. 

O 

pj [0006] In some communication systems, such as code division multiple access 

Li 

(CDMA) and time division duplex using code division multiple access (TDD/CDMA), 
multiple conmiunications are sent over the same frequency spectrum. These 
communications are differentiated by their channelization codes. To more efficiently use 
the frequency spectrum, TDD/CDMA communication systems use r epeating frames divided 
into rime slots for communication. A communication sent in such a system will have one 
or multiple associated codes and time slots assigned to it. The use of ^e code in one time 
slpt| s referred to as a resource unit . 

[0007] Since multiple conmiunications may be sent in the same frequency spectrum 
and at the same time, a receiver in such a system must distinguish between the multiple 
communications. One approach to detecting such signals is multiuser detection. In 
mu ltiuser detection, signals associated with all the UEs 14, to I43 users, are detected^ 
simultaneously. Approaches for implementing multiuser detection include block hnear 
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equalization based joint detection (BLE-JD) using a Cholesky or an approximate Cholesky 

deCOuipOSitiOll . 

[0008] Another approach is single user detection. In single user detection, data is only 
recovered for a single user (one UE 14i). Based on the application, the single user detected 
data may have been sent using one or multiple codes. Approaches for implementing single 
user detection include block linear equalization using a Cholesky or an approximate 
Cholesky decomposition. These approaches have a high complexity. The high complexity 
leads to increased power consumption, which at the UE 14^ results in reduced battery life. 
Accordingly, it is desirable to have altemate approaches to detecting received data. 

O 

O [0009] SUMMARY 
NJ 

M [0010] A time division duplex using co de div ision multiple access user equipment 
yi — — 

□ r eceives a plurality of data si gn als in a time slot . E ach da ta signal experiences a si milar 
g chan nel respon se^ The user equipment receives a combined signal over the shared spectrum 
^ in a time slot. The combined signal comprises the plurality of data signals. The combined 
^ signal is sampled ayi_^multipb of ajL^i ^^ The similar channel 



response is estimated. A channel jresponse madix or a channel correlati on matrix is 
contsructed based on in part the estimated channel response^ A s pread data vector is 
determi ned based on in partji^fast fourier transform fPFT) decompositio n of a circulant 
version of the chan nel re spojise^r channdj;jDnelMQn^^ The spread data vector is 
despread to recover data from the matrix. 

[001 1] BRIEF DESCRIPTION OF THE DRAWING(S) 

[0012] Figure 1 is a wireless communication system. 

[0013] Figure 2 is a simplified transmitter and a single user detection receiver. 
[0014] Figure 3 is an illustration of a conmiunication burst. 
[0015] Figure 4 is a flowchart of low complexity data detection. 
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[001 6] Figures 5- 1 5 are graphs of the performance of low complexity data detection. 

[0017] DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 
[0018] Figure 2 illustrates a simplified transmitter 26 and receiver 28 using low 
complexity data detection in a TDD/CDMA communication system. In a typical system, a 
transmitter 26 is in each UE 14^ to 14^ and multiple transmitting circuits 26 sending multiple 
communications are in each base station 12i to I25. The lqw__comple2dty_^^^^ 
receiver 28 may be at a base station 12^ . UEs 14j to 14^ or both. The receiver 28 can be used 
at a UE 14i for either njulliu&er or single-oiseiLdetection of a medium to high data rate 
service, suclLasaL2^megabits per second (Mhs). The receiver 28 can also be used at a base 
station 12„ when only a single UE 14i transmits in a time slot. 

[0019] The transmitter 26 sends data over a wireless radio channel 30. A data 
generator 32 in the transmitter 26 generates data to be communicated to the receiver 28. A 
modulation/spreading sequence insertion device 34 sp reads the data a nd makes the spread 
reference da ta time-multip lexed with a midamble training sequence in the appropriate 
^ assigned time slot and c odes for spread i ng the dat a, producing a communication burst or 
bursts. 

[0020] A typical conmiunication burst 16 has a midamble 20, a guard period 18 and 
two data bursts 22, 24, as shown in Figure 3. The m idamble 20 sep arates the two data bursts 
22, 24 and Jt he_guard period 18 separates the aQ mmuiilcatiojL_bursts to allow for th e 
difference in arrival times of burst s transmi tted from different transmitters 26. The two data 
bursts 22, 24 contain the communication burst's data. 

[0021] The communication burst(s) are modulated by a modul ator 36 to radio 
frequency (RF). An antenna 38 radiates the RF signal through the wireless radio channel 
30 to an antenna 40 of the receiver 28. The type of modulation used for the transmitted 
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communication can be any of those known to those skilled in the art, such as quadrature 

phase shift keying (QFSK) or an N-ary quadrature ampUtude modulation (QAM). 
[0022] The antenna 40 of the receiver 28 receives various radio frequency signals. 
The received signals are demodulated by a demodulator 42 to produce a baseband signal. 
The baseband signal is processed, such as by a channel estimation device 44 and a low 
complexity data detection device 46, in the time slo t and with the appropriate codes assigned 
to the received bursts. The channel estimation device 4 4 uses the midamble training 
sequence component in the baseband signa l to provide channel information, such as channel 
im pulse responses . The channel information is used by the^ d ata detection device 46 to 

p esti mate the transmitted data of the received conmiunication bursts a s hard symbols. 

5 [0023] The data detection device 46 uses the c hannel information provided bv the 



SI 

m 



channel estimation device 44 a nd the k nown spreadin gjcodes used by the transmitter 26 to 

^ estimate the data of the desired received communication burst(s). Low complexity data 

^ detection is explained in conjunction with the flowchart of Figure 4. Although low 

fU complexity data detection is explained using the third generation partnership project (3GPP) 

m universal terrestrial radio access (UTRA) TDD system as the underlying conmiunication 
O 

fy system, it is appUcable to other systems. That system is a direct sequence wideband CDMA 
(W-CDMA) system, where the upUnk and downlink transmissions are confined to mutually 
exclusive time slots. 

[0024] The receiver 28 receives using its antenna 40 a total o f K bursts that arrive 
simultaneously, 48. The ^bursts are superimposed on top of each other in one observation 
interval. Some or all of the AT bursts ma yjrise from or go to the same users for higher data 
rate service^. For the 3GPP UTRA TDD system, e ach data field of aJime slQ^^^^ 
to^ne observation interval. 

[0025] A k!^ burst of the K burst s uses a code of C^^^ of length Q chips t o spreade ach 
of its A^^ symbols to yield a sequence of length Q N ^ chips. The Id^ burst passes through a 




I-2-178.3US 

channel with a known or estimated channel response, h}^ ^ , of length W chips to form a chip 
sequence of length, N ^ = (SF • N ^ + W - \) . 5F is the spreading factor. Since uplink 
signals may originate from multiple UEs 14, to Uj, each h}^^ in the upUnk may be distinct. 
For the downlink in the absence of transmit diversity, all bursts pass through the same 
channel and have the same h}^^ . At the receiver 28, the bursts from all users arrive 

superimposed as a single received vector , r . Some or all of the K bursts may be part of a 

/ 

multi-code transmission. The multi-codes have the same h}'^\ because they originate from 
Q the same transmitter 26. / 

w [0026] The multi-user signal model consists of known received chips and K - N ^ 

^ ,/ ^ . 

^ unknown information bearing symbols. | The symbol response, , of the fc^^ buLSlis_flLe 

g convolution of C}^^ with h}^^ . Accordingly, is of length (5F+W-1) chips. Wis the .^a^^ 

o — ^ 

nj impulse response, which represents the trail of^Mpj_lefLby_^ The 

^ — ~ 

^ unknown symbols of the Jd^ burs t form a column vector d}^K r}^^ is the contribution of the 
k^^ burst to the overall received chip vector, r . d}^^ is the data vector for the k^ ^ burst. 



d}''^ and r}^^ are related by Equation 1.. 

(k) ^ ^(t)^(i) ^ ^here k = 1...K Equation 1 



[0027] k'*^ is the channel response matrix for the burst, which is an x N, matrix 
whose column is the symbol-response of the element of d}^K Assuming a time-invariant 

symbol-response, each column of A^*^ has the same support, s}^\ and successive columns 
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1 



are zero-padded and shifted versions of the first column. The overall, chip-rate, received 
vector is per Equation 2. 

K 

r = ^ r}^^ +n Equation 2 

1=1 

n is a zero-mean noise vector with independent identical distribution (i.i.d.) components of 
the variance, cr^ . Equation 2 becomes Equation 3, when written as a single matrix equation. 

y .1?^ y P=kd+n' Equations 

w — ' 

^ [0028] A is the Everall channel response matrix] which is a matrix of size N^xK -N,. 



a 



II 



0 is the data vector, which is a column vector of length K - N ^ . Equation 2 and Equation 
fU ~ — ■ 

3 model the inter-symbol interference (ISI) and multiple-access interference (MAI) in the 
5 received vector, r . 

[0029] /The signal models of Equations 1, 2 and 3 are form ulated for chip rate 
^mgling,_such as 3.84 Mega chips per second (Mcps) in 3GPP UTRA systemi|_For 
increased statistical accuracy, a receiver 2$^^^e over-sampli^^^^^^a multiple chip 
rate sampling. M. typical multiple chi p rate sampling js twice the chip rate, although other iji^A 
multiples mgy^e used. When using multiple chip rate sampling, the received signal burst 
willb e over-s a mpled generat in g multiple sampled sequence s. Ea ch sequence is sampled at 
the chip rate with different .time o ffsets with res pect to, one another. The kf'' burst passes 

through a channel with a known or estimated channel response, h]^ ^ , for the m"' sampled 
sequence, r^^ is the contribution of the A:** burst to the m'* overall sampled chip vector, r„ . 
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The data symbol vectors d}^^ and the m"' sampled chip vector r^^ are related by Equation 
4. 

lH^^ =All'^d}^\ k =l-- K, m=l..M Equation 4 

A^*^ is the symbol response matrix for the m"' sequence. It is a matrix of size N^xN 

whose column is the m'* sampled symbol-response of the element of d}''K 

[0030] Equation 5 is the overall, chip-rate, received vector, , of the m"' sampled 



Q sequence. 

SJ 
Si 

I 

O Lm=X-"^'''- m=l...M Equations 



^ [0031] For an Mmulti ple of chip rate samplin g; a sin g le matrix e xpres sion is p er 



o 



fU Eguation6. ; 

..l'^^'^ ^^^^^^ 



l^j^ r =A'd_+n Equation 6 



[0032] r is the received signal vector and is defined as per Equation 7. 
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r = 



Equation 7 



[0033] A ' is defined as per Equation 8. 



S 
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Equation 8 



fU 



1^ [0034] Equation 9 is Equation 6 rewritten as a summation form of K bursts. 



m 

□ 

m 



Equation 9 



[0035] Equation 9 can be rewritten as Equation 10. 



Equation 10 
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[0036] C is code sequence of the burst. H'^^^ is the channel response for the 
AT" sequence, which is defined for M multiple chip rate sampling per Equation 1 1 . 
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Equation 1 1 



[0037] When all the signal bursts in a time slot arise from the same user in the upUnk 
or go to the same user in the downlink, the bursts pass through the same propagation path 

and, accordingly, the same fading channel. As a result, is the same for all bursts 

=H'^^^ =H^ , for all itandj) and is replaced in Equation 10 with as per Equation 

^^^^ 

12. .^.^ 



Equation 12 



k=\ 



[0038] 



Equation 13 is Equation 12 rewritten as a single matrix expression. 



r_ =H;Cd_-hn 



Equation 13 
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[0039] 



C is the code matrix. For M chip rate sampling, is per Equation 14. 



Hex 

H 



c2 



H 



cM 



Equation 14 



Q 
O 
SI 
SJ 

m 



fU 

m 
p 



[0040] For an m'* chip rate sample, H^^ is the channel re sponse for the m"' sampled 
sequence. Each H^^, m = L..M , is determined by the channel estimation device 44, 50. 



The matrix structure of each f/^^ is per Equation 15, 52. 



*m,0 



'm,2 



*ra,0 



*m,l 



'm.2 





K,W-3 
















0 


0 



0 



0 



0 

Kx 

K.2 



0 



0 

K.i 



*m,r-3 



Equation 15 
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[004 1 ] The overall signal model of data detection is represented as per Equations 1 6 



J 1 ^ 
OiiU I / 



r -H^s^-^-n^ Equation 1 6 

l = Cd_ Equation 17 

[0042] I is the spread data chip vector. C is the code vector. One approach to 
^ determine s_ is to use a zero forcing (ZF) solution of Equation 16 as per Equation 18. 

1 = (h'^" H'" r_ Equation 18 — 




S 

O 

fU [0043] H'c" is the hermitian of H^' . Another approach is to use a minimum mean 

in 



fe" square error (MMSE) solution as per "Equation 



0^ 

5^ 



l = {Hf H',+G^l^^ H'^^ I Equation 19 



[0044] is the^noise variance. / is the identity matrix. After solving either 

Equation 17 or 18 for^ , the solution of Equation 17 is obtained by despreading, as 
represented by Equation 20,j56. 



Equation 20 
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[0045] The following approaches to solve Equations 1 8 and 1 9 for s use a fast fourier 

transform (FFT) decomposition of either a circulant approximation of the channel 
correlation matrix^^Rjo^^ channel response matrix, , ^54^Using eitiier mabix reauires 

an approximation; however, using the channel response matrix, H ^ , also requires truncation 

of the last W-1 rows of the matrix to make it square. Accordingly, to eUminate degradation 
due to truncation, the channel correlation matrix, /?, is preferably used. 
[0046] A FFT decomposition of the channel correlation matrix, R, is performed as 
follows. For a ZF approach, R is defined as per Equation 21 . 

r/h^H^ Xff i H„ Equation 21 

m=l 



O [0047] For a MMSE approach, R is defined as per Equation 22. 
fU 

m 

O R =H'" h; + o^I Equation 22 



[0048] The structure of the channel correlation matrix, R, is represented as per 
Equation 23. 
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Equation 23 



[0049] Equations 18 and 19 are rewritten in terms of R as per Equations 24 and 25, 
respectively. 




Equation 24 



Equation 25 



[0050] The matrix- vector multiplication Rs_ can be viewed as a linear combination 

of column vectors of the channel correlation matrix, /?, weighted by tiie corresponding 
elements of data chip vector s_ , as per Equation 26. 



-13- 



I-2-178.3US 




SF 



Equation 26 



[0051] 



g. is the i column of the channel correlation matrix R. s, is the i element 



of spread data chip vector 5 . 
[0052] [fiy modifying the structure of matrix optim um circulant matrix 



approximation ofchannel correlati on matrix^ , can be deterrnined using Equation 2' 
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Equation 27 



[0053] The first column, q , has the full non-zero elements without any truncation. 
The circulant matrix, R^.^ , is defined by its first column q . The first colunm q of circulant 
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matrix, /?^,, , is obtained by permuting the W* column of the channel correlation ma trix, 
/?, using the permutation operator or index vector as defined^ by Equation 2 8. 

p =[W:Af -eaiW -l] ^ ' 5 Equation 28 

[0054] Alternately, a circulant matrix is also defined by the column g ^ of 
channel correlation matrix, / ?. In general, any column greater than colunm may be used 



with a proper index vector (permutation vector). 

p [0055] This alternate approximate circulant channel correlation matrix, K[^^ , relates 
O 

"^-J to /?^,^ per Equation 29. 

o 

^ p). Equation 29 

£ 

Q 

J [0056] The advantage with this approach is that is used directly without 

permutation. However, the solved spread data chi p .vector £ is required to^be, inverse^ 
permuted by the index vector 'p as per Equation 30. 

[0057] By permuting the first ro w in the previous approach, the need for inverse 
permuting 5 is eliminated. 

p =[Af -ly +2 : -5^,1 : -SF +l] Equation 30 
[0058] Equation 31 is the FFT decomposition of matrix K^^^ . 
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R^i^ ==Dp^A^Dp Equation 31 

[0059] Dp is the P-point FFT matri x and A^j is diagonal matrix, whose diagonal is 

the FFT of the first column of matrix R^^^ . A^^ is defined as A^ =diag{Dpq) . 

[0060] Using a FFT decomposition of the channel response matrix, /// , is performed 

as follows. Matched filtering, H^" r , is represented by Equation 32. 

n 

H:"r_=^H,lu Equation 32 

in "■=' 
'J 

^ [006 1 ] The diannel re sponse mat rix that co rresponds to each.samp.led,s.e.q.uejiee.,^j„ , 

ij\ m =1,2,..., M , a re circulant matrixe s. Each matrix can be decom p^sedj nto three FFT 

O 

pj mattjjUDiLltiEiicatiQn as per Equation 33. 



=D;'A„^^Dp, m=\...M Equation 33 

[0062] As a result, the decomposition of the channel response matrix is per Equation 
34. 



H;" L =DJ' X A*,^^ D, r„ Equation 34 
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[0063] To recover the data chip vector s , Equation 35 is used. 

^ ' ? 

i = Ki":"r. =D,-'A-;2;A',__i),r, / Equation 35 

m=l ^3 

[0064] In the frequency domain. Equation 35 becomes Equation 36. 



Q F(s)=-^ Equation 36 

S Fiq) 
SI 

m 
o 

e [0065] <S) represents the operation of element by element multiplication. Using 
O 

rU Equation 36, F {s) is determined^ By taking the inverse transform of F {s) , the spread data 
H — . — ^ 

vector, s , is determined. If used for multi-user detection in the downlink or a single user 
Q - A, 

Kl ^^^^^ 

■ ' solely uses one time slot in the upUnk, i is despread by using all of the codes to recover the 

transmitted data d_ as soft symbols. If used for single user detection in the downlink, 5 is 

despread using that user's codes to recover that user's data as soft symbols. Hard decisions 
are made to convert the soft symbols to hard symbols. 

[0066] Two approaches to implement the FFT composition are a prime factor 
algorithm (PFA) and a radix-2 algorithm. Although a PFA is considered more efficient than 
a radix-2 algorithm when a non-power-of-two number of FFT points is used, the following 
complexity analysis is based on a radix-2 FFT implementation for simpUcity. The 
complexity based on radix-2 algorithm can be considered as the worst case. Additional 
improvement in complexity is obtainable when PFA is used. Zero-padding radix-2 FFT 
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implementation entails the zero-padding the first colunm of //^^ , m = 1... M , the vectors 
r^,m=l...M and q . The zero-padding makes their length equal to the nearest radix-2 

integer that is greater than or equal to the length of a data field. For example, the length of 
a data field is 976 chips for burst type 1 in a TDD burst specified by 3GPP W-CDMA 
standard. The nearest radix-2 integer of 976 is 1024 (P = 1024). P is the radix-2 integer. 
[0067] Four types of radix-2 FFT computations are required: Dpr„, Dp}±„, Dpg^ 

and — . Two of the computations are computed M times for all sampled sequences: 

□ Dnr„, m=l...M and Dph„, m=l..,M . The other two are performed only once for 
O 

^ the sampled sequences. Dphj„,m =1...M and D^^^ are computed once per time slot. 

m 

Q Dd(-) 

\Q Dpr_„,m=l...M , '^^ are computed twice per time slot. As a result, a total of 3(M + 1 ) 

b 

W radix-2 FFT computations are required. Each needs Plog2/* complex operations. By 

m assuming each complex operation requires four real operations, the complexity for radix-2 
□ 

fy FFT computations in terms of million real operations per second (MROPS) is per Equation 
37. 



C, =3(M -l-l)?log2?-4-100-10"* MROPS Equation37 

[0068] For the complexity of the vector multiplications, there are M element-to- 
element vector multiplications and one element-to-element vector division, which are 
performed twice per time slot. As a result, the complexity for the vector operations in terms 
of MROPS is per Equation 38. 
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=2(M +1)P-4 100-10~* MROPS Equation38 

[0069] For the complexity of calculating the vector q , it requires MW^ complex 

operations, which are performed once per time slot. The complexity in terms of MROPS is 
per Equation 39. 

C3 =MH^^ •4 100 10"*^ MROPS Equation 39 

^ [0070] The total complexity except for the despreading in MROPS is per iEquation 
O 

O 40. 



O C,ft=C, +C2+C3 MROPS Equation 40 

h 

P [0071] Despreading is performed twice per time slot. The complexity of despreading 

W in terms of MROPS is per Equation 41 . 

O 

ru 

= 2'K -N -Q-A-m ■10~^ MROPS Equation 41 



[0072] As a result, the total complexity of the data detection including despreading 
is per Equations 42 or 43. 

Crotai =Cff,+ Crf„p MROPS Equation 42 



Cj^^^l=[3iM +l)P\o%2P + UM +l)P+ MW"^ +2KNQ] 4 m iO ^ MROPS 

Equation 43 
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[0073] The following tables show the complexity in MROPS for a 1 024-point radix-2 
(P = 1024) computation. Complexity is shown in Tables 1 at the chip rate and at Table 2 
at twice the chip rate sampling. A complexity comparison is made in MROPS between 
BLE-JD using approximate Cholesky decomposition and low complexity data detection, as 
shown in Tables 3 and 4. Table 5 is a complexity comparison showing the complexity of 
low complexity data detection as a percentage of the complexity of BLE-JD using 
approximate Cholesky decomposition. As shown, low complexity data detection has a much 
lower complexity than approximate Cholesky based BLE-JD. Depending on the number of 
bursts transmitted and spreading factors, for most cases, low complexity data detection is 
25% at the chip rate, and 30% at twice the chip rate, of the complexity of approximate 
Cholesky based BLE-JD. 

[0074] Table 1 . MROPS of a full-burst using low complexity data detection for burst 
type 1 at chip rate sampling. 
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Funcs executed twice per half- 
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[0075] 



Table 2. MROPS of a full-burst using low complexity data detection for burst 



type 1 and twice Irie chip rate sampiing. 





Funcs Executed once per burst 


Funcs executed twice per half- 
burst 


#of 
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Construct 


Compute 

m =1...M 

Via 
Radix-2 
FFT 


Compute 
Via 
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FFT 
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Total 


1 


2.6 


8.2 


8.2 


16.4 


16.4 


0.78 


52.6 


8 


2.6 


8.2 


8.2 


16.4 


16.4 


6.25 


58.1 


12 


2.6 


8.2 


8.2 


16.4 


16.4 


9.4 


61.2 


13 


2.6 


8.2 


8.2 


16.4 


16.4 


10.1 


61.9 


14 


2.6 


8.2 


8.2 


16.4 


16.4 


10.9 


62.7 


16 


2.6 


8.2 


8.2 


16.4 


16.4 


12.5 


64.3 



SI 

m 

s 

□ 
m 

□ 
ry 



[0076] Table 3. Comparison in MROPS between BLE-JD (approximate Cholesky 
decomposition) and low complexity data detection at chip rate sampling. 



Spreading Factor, Q 


# of bursts, 
K 


Proposed algorithm 


BLE-JD 


1 


1 


26.7 


318.2 


16 


8 


32.2 


81.1 




12 


35.3 


174.6 




13 


36 


205.5 




14 


36.8 


239.4 




16 


38.4 


318.2 
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[0077] Table 4. Comparison in MROPS between BLE-JD (approximate Cholesky 
decomposition) and low complexit>' data detection at twice the chip rate sampling. 



^nrpaHincy Partor O 


# nf hiir.sts 
K 


Pronosed algorithm 


BLE-JD 


1 


1 


52.6 


427.6 


16 


8 


58.1 


124.8 




12 


61.2 


248.3 




13 


61.9 


287.7 




14 


62.7 


330.4 




16 


64.3 


427.6 



[0078] Table 5 . Complexity of FFT of the channel correlation matrix as a percentage 
of the complexity of approximate Cholesky based BLE-JD. Approximate Cholesky based 
BLE-JD is set at 100% complexity. 



spreading Factor, Q 


# of bursts, K 


Chip rate sampling 


Twice the chip rate 
sampling 


1 


1 


8% 


12% 


16 


8 


39% 


47% 




12 


20% 


25% 




13 


18% 


22% 




14 


15% 


19% 




16 


12% 


15% 



[0079] Figures 5- 1 5 are graphs of the performance of low complexity data detection. 
Two high date rate services are simulated. One is single-code transmission with SF = 1 and 
the other is multi-code transmission with twelve codes and spreading factor 16 for each. 
Low complexity data detection is tested under various delay spread types including 3GPP 
working group four (WG4) defined delay spread channel cases 1, 2 and 3. The simulations 
are set for both chip rate and twice the chip rate sampling. The length of delay spread is 
assumed W= 57. Zero timing error is assumed through the whole simulations. The channel 
impulse response is assumed to be exactly known. In general, the bit error rate (BER) 
performance of the multi-code case is better than its corresponding single-code counterpart 



-22- 




I-2-178.3US 



in the simulation. For the particular example used in the simulation, single-code 

transmission uses 1 6 resource units per tinie slot while the multi-code transmission uses only 

12 resource units in each time slot. Using only 12 codes produces less interference and 

therefore better BER. As compared with BLE-JD, only Uttle or limited performance 

degradation are observed for proposed algorithm based on FFT decomposition of the 

channel correlation matrix (FFT-R) in both single-code and multi-code cases. In single-code 

case, the FFT-R based approach is identical to the block linear equalization structure. The 

proposed FFT-R based approach and the approach based on FFT of die channel response 

matrix (FFT-H) are identical to each other at the chip rate sampHng. 

\^ [0080] The performance of low complexity data detection using FFT-R and FFT-H 
□ 

□ is compared to an ideal single user bond, a worst case matched filtering, BLE-JD and single 
user detection with BLE using an approximate Cholesky decomposition. For the working 

Q points of interest, the BER range was typically between 1 % and 10%. Only a httie or limited 

^ signal to noise ratio (SNR) performance degradations are observed for low complexity data 

P detection as compared with BLE-JD, and significant SNR performance enhancement over 

H matched filtering (MF). Low complexity data detection also performs well in an additive 

m 

O white gaussian noise (AWGN) channel environment. Figures 5-15 show that low 



complexity data detection offers very comparable performance in BER or SNR at much 
lower complexity and power consumption as compared to BLE-JD using approximate 
Cholesky decomposition. 
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